Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Aug 6 changes #16

Closed
wants to merge 7 commits into from
Closed

Aug 6 changes #16

wants to merge 7 commits into from

Conversation

nathanjzhao
Copy link
Contributor

Get nans in scores after close to standing up (when there are nans, the episode count is very long, meaning there's no failures --- pretty sure it's standing up). Nans makes it not possible to see recording + make it impossible to see further actor/critic convergence though, so unable to fully tell.

Probably an infra issue somewhere but Pawel's training some stuff right now so we'll maybe get to see later.

nathanjzhao and others added 7 commits August 5, 2024 19:08
…use i literally just increased size of actor/critic models. maybe should do some ablation tests. or get working fully first! (better reward func)
Got nans in training and trying to fix by norm'ing obs and also
adding epsilon to action_log_pdf. Also some cleaning
* added tricks from blogpost

* reorder memory

* shown trying to stand up + increasing score? not sure if this is because i literally just increased size of actor/critic models. maybe should do some ablation tests. or get working fully first! (better reward func)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant